Small Sample Statistics for Classi cation Error Rates I : Error Rate Measurements

نویسندگان

  • J. Kent Martin
  • D. S. Hirschberg
چکیده

Several methods (independent subsamples, leave-one-out, cross-validation, and bootstrapping) have been proposed for estimating the error rates of classiers. The rationale behind the various estimators and the causes of the sometimes con BLOCKINicting claims regarding their bias and precision are explored in this paper. The biases and variances of each of the estimators are examined empirically. Cross-validation, 10-fold or greater, seems to be the best approach; the other methods are biased, have poorer precision, or are inconsistent. Though unbiased for linear discriminant classiers, the 632b bootstrap estimator is biased for nearest neighbors classiers, more so for single nearest neighbor than for three nearest neighbors. The 632b estimator is also biased for Cart-style decision trees. Weiss' loo* estimator is unbiased and has better precision than cross-validation for discriminant and nearest neighbors classiers, but its lack of bias and improved precision for those classiers do not carry over to decision trees for nominal attributes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Small Sample Statistics for Classi cation Error Rates

Several techniques for estimating the reliability of estimated error rates and for estimating the signicance of observed dierences in error rates are explored in this paper. Textbook formulas which assume a large test set, i.e., a normal distribution, are commonly used to approximate the condence limits of error rates or as an approximate signicance test for comparing error rates. Expressions f...

متن کامل

Bias in land cover change estimates due to misregistration

Land cover change may be overestimated due to positional error in multi-temporal images. To assess the potential magnitude of this bias, we introduced random positional error to identical classiŽ ed images and then subtracted them. False land cover change ranged from less than 5% for a 5-class AVHRR classiŽ cation, to more than 33% for a 20-class Landsat TM classiŽ cation. The potential for fal...

متن کامل

Modified signed log-likelihood test for the coefficient of variation of an inverse Gaussian population

In this paper, we consider the problem of two sided hypothesis testing for the parameter of coefficient of variation of an inverse Gaussian population. An approach used here is the modified signed log-likelihood ratio (MSLR) method which is the modification of traditional signed log-likelihood ratio test. Previous works show that this proposed method has third-order accuracy whereas the traditi...

متن کامل

Double-bagging: combining classifiers by bootstrap aggregation

The combination of classi"ers leads to substantial reduction of misclassi"cation error in a wide range of applications and benchmark problems. We suggest using an out-of-bag sample for combining di0erent classi"ers. In our setup, a linear discriminant analysis is performed using the observations in the out-of-bag sample, and the corresponding discriminant variables computed for the observations...

متن کامل

Inference for Health Econometrics: Inference, Model Tests, Diagnostics, Multiple Tests, and Bootstrap

This paper presents a brief summary of classical statistical inference for many commonly-used regression model estimators that are asymptotically normally distributed. The paper covers Wald con…dence intervals and hypothesis tests based on robust standard errors; tests of model adequacy and model diagnostics; family-wise error rates and false discovery rates that control for multiple testing; a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996